Serveur d'exploration sur la TEI

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Corpus-Concordance-Database-VARBRUL'

Identifieur interne : 000529 ( Main/Exploration ); précédent : 000528; suivant : 000530

Corpus-Concordance-Database-VARBRUL'

Auteurs : John M. Kirk [Royaume-Uni]

Source :

RBID : ISTEX:BB91AA8C5A07C6FB65D07C0A1793123181B9E6FB

Abstract

Although concordances are in widespread use and have become the main output format of corpus analysis, they do little more than rearrange the selected data as a special kind of list. By yielding enough preceding and following context, however, concordances are usually sufficient for the purposes of classification and analysis. As items of high frequency have different senses, perform different functions, occur in differently constructed environments, and are thus used variably, classification is a major part of linguistic analysis. Up until now, however, it has. not been easy to store classificatory encodings alongside the concordance data, with a view to further exploitation of these encoded classifications. One solution is the importation of concordances into a database, where additional fields can then be created for mnemonic classificatory encodings, for further sorting on the basis of these encodings. This paper shows how this solution can be implemented in practice. It shows how concordances can come to be used in an ‘intelligent’ way as the basis of linguistic analysis; it also provides a practical tip for the use of the main software package for the analysis of interacting variables: VARBRUL For its quantitative and statistical correlation of co-occuring variants of different internal or external variables, VARBRUL depends utterly on a tokens file of encodings about the behaviour of each item. This paper shows how this tokens file can now be created by copying the classificatory encodings directly from the database.

Url:
DOI: 10.1093/llc/9.4.259


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title>Corpus-Concordance-Database-VARBRUL'</title>
<author wicri:is="90%">
<name sortKey="Kirk, John M" sort="Kirk, John M" uniqKey="Kirk J" first="John M." last="Kirk">John M. Kirk</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:BB91AA8C5A07C6FB65D07C0A1793123181B9E6FB</idno>
<date when="1994" year="1994">1994</date>
<idno type="doi">10.1093/llc/9.4.259</idno>
<idno type="url">https://api.istex.fr/document/BB91AA8C5A07C6FB65D07C0A1793123181B9E6FB/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000461</idno>
<idno type="wicri:Area/Istex/Curation">000461</idno>
<idno type="wicri:Area/Istex/Checkpoint">000431</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">000431</idno>
<idno type="wicri:doubleKey">0268-1145:1994:Kirk J:corpus:concordance:database</idno>
<idno type="wicri:Area/Main/Merge">000568</idno>
<idno type="wicri:Area/Main/Curation">000529</idno>
<idno type="wicri:Area/Main/Exploration">000529</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a">Corpus-Concordance-Database-VARBRUL'</title>
<author wicri:is="90%">
<name sortKey="Kirk, John M" sort="Kirk, John M" uniqKey="Kirk J" first="John M." last="Kirk">John M. Kirk</name>
<affiliation wicri:level="2">
<country>Royaume-Uni</country>
<placeName>
<region type="country">Irlande du Nord</region>
</placeName>
<wicri:cityArea>The Queen's University of Belfast</wicri:cityArea>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="j">Literary and Linguistic Computing</title>
<idno type="ISSN">0268-1145</idno>
<idno type="eISSN">1477-4615</idno>
<imprint>
<publisher>Oxford University Press</publisher>
<date type="published" when="1994">1994</date>
<biblScope unit="volume">9</biblScope>
<biblScope unit="issue">4</biblScope>
<biblScope unit="page" from="259">259</biblScope>
<biblScope unit="page" to="266">266</biblScope>
</imprint>
<idno type="ISSN">0268-1145</idno>
</series>
<idno type="istex">BB91AA8C5A07C6FB65D07C0A1793123181B9E6FB</idno>
<idno type="DOI">10.1093/llc/9.4.259</idno>
<idno type="ArticleID">9.4.259</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0268-1145</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract">Although concordances are in widespread use and have become the main output format of corpus analysis, they do little more than rearrange the selected data as a special kind of list. By yielding enough preceding and following context, however, concordances are usually sufficient for the purposes of classification and analysis. As items of high frequency have different senses, perform different functions, occur in differently constructed environments, and are thus used variably, classification is a major part of linguistic analysis. Up until now, however, it has. not been easy to store classificatory encodings alongside the concordance data, with a view to further exploitation of these encoded classifications. One solution is the importation of concordances into a database, where additional fields can then be created for mnemonic classificatory encodings, for further sorting on the basis of these encodings. This paper shows how this solution can be implemented in practice. It shows how concordances can come to be used in an ‘intelligent’ way as the basis of linguistic analysis; it also provides a practical tip for the use of the main software package for the analysis of interacting variables: VARBRUL For its quantitative and statistical correlation of co-occuring variants of different internal or external variables, VARBRUL depends utterly on a tokens file of encodings about the behaviour of each item. This paper shows how this tokens file can now be created by copying the classificatory encodings directly from the database.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Royaume-Uni</li>
</country>
<region>
<li>Irlande du Nord</li>
</region>
</list>
<tree>
<country name="Royaume-Uni">
<region name="Irlande du Nord">
<name sortKey="Kirk, John M" sort="Kirk, John M" uniqKey="Kirk J" first="John M." last="Kirk">John M. Kirk</name>
</region>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Ticri/explor/TeiVM2/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000529 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000529 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Ticri
   |area=    TeiVM2
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:BB91AA8C5A07C6FB65D07C0A1793123181B9E6FB
   |texte=   Corpus-Concordance-Database-VARBRUL'
}}

Wicri

This area was generated with Dilib version V0.6.31.
Data generation: Mon Oct 30 21:59:18 2017. Site generation: Sun Feb 11 23:16:06 2024